Anomalous Diffusion on Random Graphs 
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We show that anomalous diffusion can result when the steps of a random walk are not statistically indepen- 
dent. We present an algorithm that counts all the possible paths of particles diffusing on random graphs with 
arbitrary degree distribution. Using this to calculate the mean square displacement, we show that in sharp con- 
trast to continua, random walks on random graphs can exhibit anomalous behavior and yet have well-defined 
and predictable properties. 
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It is well known that a random walk on a continuous 
medium leads to a mean square displacement (x 2 ^ that is lin- 
ear in time as long as there are no infinitely large steps and all 
steps are statistically independent. Levy showed that anoma- 
lous diffusion results when the former assumption is violated. 
In this paper we show that anomalous diffusion results when 
the latter assumption is violated but the former remains, by 
considering random walks on random graphs. 

Random graphs have been very successfully used to de- 
scribe many diverse systems. For example, the spreading of 
diseases through a population has been modelled where peo- 
ple are the vertices and contact between them the edges [|IJ ||]. 
The internet can be thought of as a graph with web sites as ver- 
tices and links as edges, leading to estimates of the number of 
clicks to surf between any two random sites and other quanti- 
ties of interest [0,0]. In general, any system with interacting 
parts, which encompasses a vast array of diverse systems, can 
be mapped onto a graph [|[. 

Along with to statics, dynamic properties of random graphs 
are of interest for many cases like protein folding ^ and 
glassy relaxation [Q]. In this paper we consider particles 
diffusing along the edges of a random graph and develop a 
method for calculating diffusion on a random graph using an 
algorithm that counts all the paths. 

The following is a brief introduction to graph theory. For 







FIG. 1 : A simple graph. The circles represent vertices and the lines 
represent edges. 



a more comprehensive overview see, for example, Godsil and 
Royle [^[|. Graphs consist of two sets: vertices, which are 
points, and edges, which join two vertices. In more mathe- 
matical terms, define for some index set I, the vertex set 



V = {vi | i e I}. 



(1) 



Then, define the edge set as a subset of the pairs of all ver- 
tices 



£ = {(v;,VjO|v;,V*eV}CVxV. 



(2) 



These two sets can be thought of as points and lines in a 
plane as in Figure [j]. The pairs of vertices defining an edge 
are called endpoints of the edge. There are no real restrictions 
on the sets of vertices and edges, making graphs very general 
constructions. 

Like any field, graph theory has its own lingo so here are 
some definitions. The degree of a vertex is the number of 
edges with at least one endpoint at the vertex. The degree 
distribution p(k) is the probability that a randomly selected 
vertex will have degree k. Two vertices are adjacent if there 
is an edge joining them, i.e., in Figure [j], vertices 1 and 4 are 
adjacent but 1 and 2 are not. A path or walk on a graph is a 
set of edges connecting two vertices. For example, in Figure [j] 
paths from 1 to 2 would be the the set of edges {(1,4), (4,2)} 
or {(1,4), (4,3), (3,4), (4,2)}. 

Now, we have enough definitions to consider why it is pos- 
sible for a random graph to violate the assumption that all 
steps of a walk are independent. Consider the graph in Figure 
[j]. There are three vertices with one edge and one vertex with 
three edges. Thus, the degree distribution is 



p(k) 



| k=l 
i k = 3 
otherwise. 



(3) 



Next, put a walker on that graph that can move from vertex 
to vertex along the available edges. For a walker's first step 
there is a probability p(k) that it will have k choices. How- 
ever, for the second step this is not necessarily the case. Still 
looking at the graph in Figure |IJ a walker starting at vertex 1 , 
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2, or 3 will end up at vertex 4 after one step where it will have 
three choices of edges on which to leave. A walker starting at 
vertex 4 can go to 1, 2, or 3 where it will have one choice of 
edges on which to leave. Thus, there are a total of six nearest 
neighbors, one each from vertices 1, 2, and 3 and three from 
vertex 4, and the degree distribution for them, q(k), is 



q(k) 




(4) 




Thus, since p(k) ^ q(k) the degree distribution is a function 
of the number of steps taken. This means that the number of 
choices available to a walker on a graph will depend on how 
far it has gone. Therefore, the assumption that all steps are 
uncorrelated is violated in this graph, and it can be violated 
generally in graphs. Note that it is not necessarily violated; for 
example, in a graph where all vertices have the same degree, 
called a regular graph, p(k) and q(k) are the same. 

Moving on, some more tools for dealing with graphs will 
be introduced. First, define d(ji,v) to be the distance between 
two vertices fx and v which is the size of the smallest set of 
edges making a path starting at /j and ending at v. For example 
in Figure [j], d(l,2) = 2 and , 4) = 1. Unless otherwise 
noted, all distances mentioned henceforth refer to this graph 
distance. 

Next, define the adjacency matrix A of the graph G = (V,E) 
to be a square \V\ x \V\ matrix with A^v equal to the number of 
edges joining the vertices p and v. Note that so long as there 
are no edges that connect a vertex to itself, i.e. loops, the 
diagonal elements of A must be zero, and an edge connecting 
vertex ptov also connects v to p so A is symmetric. 

As an example, consider the graph in Figure |l] which has 
adjacency matrix 



/0 1\ 

1 

1 

\i i i oy 



(5) 



The adjacency matrix of a graph can be used to count all 
paths on the graph of a given length. By definition the ele- 
ment A A , V gives the number of paths of length one from vertex 
p to v. It can be shown by induction that (A")^ v gives the 
exact number of paths of length n from ptov [^] . For exam- 
ple, looking at the graph in Figure [j], the number of paths of 
length exactly two between any two vertices are given by the 
elements of 



FIG. 2: A regular ring graph. As the size grows, a walk on this graph 
approaches one dimensional diffusion, i.e. the drunkard's walk. 



The reader is encouraged to verify this by inspection. Note 
that edges can be traversed more than once and are multidi- 
rectional. 

Consider the regular graph in Figure [| A random walk 
on this graph is just the well-known drunkard's walk with 
periodic boundary conditions which is not anomalous. Let 
(x 2 («)) denote the average distance squared as a function of 
n, where n is the number of steps taken and x is the distance 
measured from some starting vertex on the graph. One way to 
calculate (x 2 {n)) is to choose some starting vertex and count 
the fraction of paths of length n that are only a distance m < n 
away from the start, and then average over all possible start- 
ing vertices. Call this quantity p" n . Looking at the drunkard's 
walk graph, Figure ||, choose any vertex as the start since they 
are all equivalent. Then, there are two paths of length 1, one 
step clockwise and one step counterclockwise, both of which 
are distance 1 away from the start, so pj n = S m i . There are 
four paths of length 2 of which two are one step out and one 
step back, giving a distance of 0, and two are two steps out 
clockwise or counterclockwise, giving distance 2. So p\ is 
given by 



I m = 2 
otherwise. 



(7) 



In general, as the number of steps increases p n m will con- 
verge to a binomial distribution leading to the standard result 



(x 2 (n))=£m 2 p" n 



Dn, 



(8) 



/l 1 1 0\ 
1110 
1110 

\0 3/ 



(6) 



where D is the diffusion coefficient [bp. 

The central limit theorem says that changing the step size 
of the above walk only changes the diffusion coefficient, not 
the linear dependence on n. Thus, if the dependence on n is 
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FIG. 3: (j(?(n)) as calculated with adjacency matrices for walks on 
a one dimensional ring with 1000 vertices and for a two dimensional 
square lattice of size 40x40 with periodic boundary conditions. Note 
that (x 2 (ii)\ is linear in both cases as expected, but the slope changes 
with the details of the system. The lines are guides to the eye. 
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FIG. 4: A sample degree distribution generated from a random graph 
with 5000 edges and 2000 vertices as described in the text. The line 
is a Poisson distribution with the same mean value as the data. 



the only quantity of interest, graphs can be used to calculate it 
even though they ignore all spatial distances and only concen- 
trate on the number of choices available at each step. 

Using an approach similar to the above example, the adja- 
cency matrix can be used to calculate p", and hence the de- 
pendence of (x 2 ) on n for any general graph. Since the adja- 
cency matrix A can be used to calculate the exact number of 
paths of a given length between any two vertices, we can enu- 
merate all paths on the graph and find p n m using the following 
algorithm: 

1 . Let n be the current path length, starting with 1 . 

2. Raise A to the n th power. 

3. Find the number of paths between any two vertices by 
looking at the elements of A" . 

4. Keep track of the minimum distance between any two 
vertices by noting if this is the smallest n such that a 
path exists. 

5. For every m <n find p" n . 

6. Increment n and repeat. 

Once we have the distribution p" n , all the random walk prop- 
erties can be calculated. In particular, equation [| allows the 
calculation of (x 2, (n)\. 

The method presented here for enumerating paths to cal- 
culate random walk properties reduces to just a problem of 
matrix multiplication. Thus, it lends itself very well to cal- 
culation by computer where efficient and easy to use matrix 
multiplication algorithms are readily available. 

As a first test of this method, diffusion on a ring should be 
equivalent to the drunkard's walk for large enough rings. The 
same should be true for any graph representing a regular lat- 
tice. Figure U shows a calculation of (x 2 ^)) using adjacency 
matrices as described above for a ring graph with 1000 ver- 
tices and a periodic two dimensional square lattice of size 40 
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FIG. 5: A sample plot of (x 2 (w)\ for a graph with degree distribution 
with a mean of 4. (x 2 (rc)) for ballistic walks and Brownian walks are 
included for reference. The lines are guides to the eye. 



by 40. From the plot it is apparent that this adjacency matrix 
algorithm gives the correct exponent of 1, i.e. (x 2 («)) ~ n. 
The slope of the lines are the diffusion coefficients which do 
depend on the particular system. 

Now, consider some graphs which are not regular and, 
hence, can produce anomalous diffusion. One way to generate 
such graphs is to begin with a set of vertices. Next, go through 
all the vertices once and add edges where one endpoint of the 
edge is on the current vertex and the other goes to a random 
vertex. This guarantees that no vertices are isolated. Then, 
randomly connect pairs of vertices until the desired number 
of edges are laid down. The probability that a random vertex 
has k edges is given by 



p(k-l) 



V-l 



A' 



V-l 



E-k 



(9) 



where E is the number of edges, and V is the number of 
vertices and p = 2E/V is the average number of edge end- 
points per vertex. As the number of edges increases, p{k) in 
equation approaches a Poisson distribution with mean p, i.e. 
p(k) (p k /k\)e-f. See Figure § 

Once the random graph is constructed, apply the path 
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FIG. 6: A plot of the diffusion exponent versus the average degree 
for random graphs with 1800 vertices and 2000 vertices. Each are 
averaged over three separate realizations. Note that the anomalous 
exponent can be controlled by the average degree. Errors are from 
fitting (x 2 (rc)) to find the exponent. The lines are guides to the eye. 



counting algorithm and calculate (x 2 {n)). As an example, the 
result of this calculation for a graph with average degree 4 is 
shown in Figure [|. For comparison the values of (x 2 («)) are 
included for a ballistic walk, (x 2 (n)^ ~ n 2 , and a Brownian 
walk (x 2 (n)) ~ n. As can be seen in Figure |J the value of 
(x 2 («)) is somewhere between a Brownian walk and a ballis- 
tic walk so it is anomalous as expected. 

For a large enough number of steps, (x 2 («)) of a random 
graph saturates because of a finite size effect. For finite ran- 
dom graphs such as these, there exists a finite average dis- 
tance Dmax between any two vertices which scales like the 
log of the number of vertices [|To||. This means that once the 
walker has moved a distance equal to D max , every new vertex 
to which the walker moves is still only on average a distance 
Dmax away from the starting vertex. So (x 2 («)} approaches a 
constant as the number of steps approaches D max . Below this 
plateau (x 2 (n)} follows a power law < x 2 >^ n a . Figure ^ 
shows a plot of the exponent a from a power law fit for graphs 
with various average degrees. As can be seen from the plot, 
the anomalous diffusion exponent is controlled by the average 
degree. Also, as the average degree increases, the degree dis- 
tribution approaches a Poisson distribution. At the same time, 
the power law for (x 2 (n)) approaches 1.6. Thus, the exponent 
for a highly connected random graph with a Poisson degree 
distribution is 1.6. 

A problem with modelling random walks with graphs is that 
the graph distance does not necessarily correspond to any real 
spatial distance. For clarification, imagine an embedding of 
a graph into a flat surface, i.e. random dots connected with 
lines on a piece of paper. Then, one can calculate (x 2 {n)) 
directly by just following a large number of walkers and aver- 
aging their Euclidean distance on the surface measured from 
where they started as a function of time. Next, repeat this pro- 
cess for a new embedding of the same graph with different 
distances between the dots but the same connecting lines. It 
seems reasonable that this disorder averaging will lead to a 



well-defined value for the exponent. This can then be com- 
pared to the results from the adjacency matrix algorithm us- 
ing the graph distance. The hope is that since both give well- 
defined power laws for the displacement squared, these two 
measurements will lead to the same power law, possibly with 
a different coefficient but with the same exponent. To justify 
this rigorously, one would have to show that the specific dis- 
tribution of step sizes in the Euclidean embedding does not af- 
fect the diffusion exponent. In other words, graphs only keep 
track of the number of choices a walker has at each step, not 
the spatial distance it can go. So for the exponent calculated 
with the adjacency matrix algorithm to give the same results 
as a random walk in real space, the number of choices must 
be the only thing that matters. 

One verification that the exponent is insensitive to the de- 
tails of the spatial distances comes from our numerical cal- 
culation of the exponent for different realizations of random 
graphs with a given degree distribution. We find that the ex- 
ponents obtained from these agree with each other within the 
error bars of the fit. On the other hand, the exponent changes 
when the degree distribution is changed. This means that it 
is the degree distribution, the distribution of the number of 
choices a walker has at each step, not the specific realization 
of the random graph which determines the exponent. 

In conclusion, random walks on random graphs exhibit 
characteristics of anomalous diffusion which can be con- 
trolled by the degree distribution of the graph. Since this is 
drastically different from what one finds for a continuum, it 
seems that continua and random graphs are fundamentally dif- 
ferent structures. Therefore, when restrictions are placed upon 
the ability of diffusing particles to sample their surroundings, 
anomalous behavior arises. 
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